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Abstract 

We demonstrate diversification rather than optimisation for highly interacting organisms in a 
well mixed biological system by means of a simple model and reference to experiment, and find 
the cause to be the complex network of interactions formed, allowing species less well adapted 
to an environment to flourish by co-interaction over the 'best' species. This diversification can be 
considered as the construction of many co-evolutionary niches by the network of interactions between 
species. Evidence for this comes from work with the bacteria Escherichia coli, which may coexist with 
their own mutants under certain conditions. Diversification only occurs above a certain threshold 
interaction strength, below which competitive exclusion occurs. 

1 Introduction 



Understanding how diversity arises through evolution and is sustained in an ecosystem is an important 
issue. For some ecosystems, as seen in tropical rain forests, the diversity is rather high; it is quite low in 
other ecosystems. One of the key questions therein is whether interactions between organisms enhance or 
suppress diversity. If there is no explicit symbiotic interaction, it would be expected that the competition 
for a given resource leads to exclusion of many types, leading to the monodominance, i.e., the survival 
of the fittest, as determined by Gause's competitive exclusion principle pQ. In contrast, in a recent 
experiment by Kashiwagi et al. [2J, it was observed that diversification of several types of Escherichia 
coli (or e. coli) are facilitated under stronger interaction, for a well-mixed liquid culture with a given 
nutrition source. In the experiment the sole nutrition source (of nitrogen) in the culture is glutamate, 
and through mutagenesis, evolution of a single gene was studied - the gene for glutamine synthetase 
production, which synthesises glutamine from glutamate. Since the glutamine synthesis is necessary for 
the growth of the bacteria in this experiment, those with the higher activity of glutamine synthetase 
will result in faster growth of the bacteria. Indeed, in a low population density condition, only the 
fittest, i.e., those with highest enzyme activity survives. However, in a dense condition, multiple types 
including those with much lower enzyme activity coexist. Kashiwagi et al. have also confirmed [2] that 
the interaction is essential to coexistence of several types by cutting off a chemical in the medium that 
is supposed to be relevant to interaction, and showing the recovery of the survival of only the fittest. 

Motivated by this experimental result, we show that the diversification is indeed facilitated by the 
interaction. We do this by adopting a slightly modified version of the Tangled Nature (TaNa) model 
010 E]- In addition to the standard, interspecial interaction in the TaNa model, we introduce differing 
values of self-interaction for each genotype. For comparison to the e.coli experiment of Kashiwagi et al., 
this can be thought of as glutamine synthetase efficiency. A self-supporting, dominant genotype may 
coexist with, or be displaced by, a number of other genotypes that are less efficient competitors for the 
resource individually, provided that strong enough interactions are permitted. Diversity is maintained 
via the complex network of interactions. 

The TaNa model is a simple individual based model, emphasising co-evolutionary aspects in ecology. 
In particular, diversity occurs in the absence of space and without assumptions about the neutrality of 
species [SI E] , which although give very good matches with species abundance data jTJJj cannot answer 
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broader questions about why such accuracy is achieved, as species are observed to interact strongly. 
There are clearly aspects of evolution that require the specifics to be considered, and the key is finding 
an appropriate level of description. The diversification observed in e.coli is describable at the general 
interaction level of our model but not below, and although more specific models can be considered it is 
not necessary to do so for an understanding in general terms of the relationship between diversity and 
interaction. 



2 Definition of the Model 

We now define the Tangled Nature model. Individuals are represented as a vector S a = (5f , S%, ■ <S£) 
in genotype space S. The Sf take the values ±1, and we use L = 20 throughout. Each S string 
represents an entire species with unique, uncorrelated interactions. The small value of L is necessary for 
computational reasons as all organisms exist in potentia and have a designated interaction with all other 
species 1 . 

We refer to individuals by Greek letters a, /?,... = 1, 2, N(t). Points in genotype space are referred 
to as S a , S fc , and any number of individuals may belong to a point in genotype space S a . 

In the original TaNa model, individuals a are chosen randomly and allowed to reproduce with prob- 
ability p ff. 

exp[g(S",t)] 

Poff{S '* )= l + cxp[/f(S°t)] ( ' } ( ) 

They are then killed with probability pkm, which is a constant parameter. The difference between 
the original model and the one used here is the definition of the weight function H(S a , t). The original 
version used was: 

Ho(S a , t) = JL- J(S Q , S)n(S, t) - pN(t) (2) 

Where k (= 1/c from previous papers) is a control parameter, N(t) is the total number of individuals 
at time t and n(S, t) is the number of individuals with genotype S at that point. The interaction matrix 
J(S Q ,S) represents all possible couplings between all genotypes, each generated randomly in the range 
(—1,1), being non-zero with probability 0. Since the functional form of J(S a ,S b ) does not affect the 
dynamics, provided that it is non-symmetric with mean 0, we choose a form of the interaction matrix that 
speeds computation 0j. In the analysis sections, we will use shorthand versions: J a {, as the interaction 
of an individual from species b on an individual from species a, and n a as the number of individuals with 
genotype a. 

The new model also allows a self-interaction term, giving the full weight function for an individual a: 

H(S a , t) = H (S a , t) + j^n(S a , t)E(S a ) (3) 

Here, e is a new control parameter and E(S a ) is the self-interaction of individual a, generated 
uniformly random from the range (0,1). All members of the same species will have the same weight 
function and therefore the same offspring probability at a given time; i.e. if a was from species a then 
H(S a ) = H(S a ). 

To understand the meaning of the self-interaction term, we consider the weight function of a system 
with only one species a, H(S a ) = eE(S a ) — /in(S a ) since N — n(S a ). If we assume that the system is 
in a steady state (P ff = Pkm), then H(S a ) = H* = — ln(^i_. — 1), which is constant. Thus we find 

1 When discussing the model, we refer to points in genotype space as species. It is a matter of interpretation whether we 
consider genotype space to be 'coarse-grained' (resulting in each being a different species - valid when k and e are 'large' 
so that genotype differences affect reproduction probability greatly; see Equation |2]for definitions), or whether we consider 
genotype space to be a small sample of a much larger space (constant for all considered individuals), meaning genotypes 
are types of a base species (which would be valid when k and e are small, and so all genotypes have similar reproduction 
probabilities). As we operate in neither extreme and reproduction is asexual, the distinction between species and type is 
difficult. 
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n(S a ) — eE ^ S ^ H ■ , meaning that E(S a ) determines how successful species S a would be if alone in the 
system. Hence consideration for negative E(S a ) is not necessary. 

Reproduction occurs asexually, and on a successful reproduction attempt two daughter organisms 
replace the parent, with each S" mutated (flipped from 1 to -1, or from -1 to 1) with probability 
Vmut- Thus mutations are equivalent to moving to an adjacent corner of the L-dimcnsional hypercube 
in genotype space, as discussed in @|. 

A time-step consists of choosing an individual 2 a randomly, and processing according to: 

• a is allowed to reproduce with probability Poff- 

• a is killed with probability puui ■ (if ot reproduced, it is a daughter organism that is killed) . 

We define a generation as the amount of time for all individuals to have been killed, on average, 
once. For a stable population size, this is also the time for all individuals to have reproduced once, on 
average. The diversity is defined as the number of genotypes with occupancy greater than 20 to eliminate 
unsuccessful mutants from our count, and is called the wild type diversity. 

Unless otherwise stated, the parameters used will be: O = 0.2, fj, = 0.01, p m ut = 0.015, e = 2.0 and 
Pkui =0.1; see 0] for more details. The initial conditions were determined by allowing the system to find 
a monodominant q-ESS by running the system for 5000 generations with all interaction disabled (fc = 0), 
thus the best competitor in the initial set was found. Then the interaction was enabled by setting k to 
the desired value. 



3 Results 

3.1 Observed behaviour 

As in the basic Tangled Nature model, the system experiences a number of quasi-Evolutionary Stable 
Strategies (or q-ESSs) during which a single genotype or set of genotypes is present with constant 
occupancy, ignoring fluctuations. The q-ESS may end abruptly, leading to a disorder phase before a new 
q-ESS is found, which may or may not contain some of the same genotypes. For the parameter ranges 
we study, the disorder phase lasts only tens of generations and so is instantaneous on an evolutionary 
timescale. This behaviour is shown in Figure ^ with some major events labelled. 
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Figure 1: A sample run (k — 10) showing all genotypes with occupations greater than 20 as an unordered genotype label. 
Times shown correspond to different cross-over types, (a) is from the original monodominance to a diverse state, which 100 
generations later becomes more diverse again, (b) shows a cross-over from one diverse state to another, which at (c) becomes a 
new monodominant state. Then at (d) the system returns to a new diverse state. 



We are interested in what happens as we change the interaction strength. As shown in Figure 
around k w 5.5, a change from monodominance to the possibility of diverse systems occurs. At low k 

2 In previous versions a different individual was chosen for reproduction and killing actions. Here we select only one 
individual and process it for reproduction and killing for code efficiency reasons - above the level of fluctuations the two 
methods are equivalent. 
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values, a large number of mutant species are examined by the system, but their numbers are suppressed 
by lack of positive interactions - if another species is successful then the original strain must become 
extinct. At higher values of k average diversity increases, with the possibility of both monodominancc 
and diverse states at different times as shown in the example, Figure H] 




5 10 15 20 25 30 
inter- species interaction strength k 



Fig Ure L\ k dependence of the average diversity of an evolved system, taken for 40000 — 50000 generations and 500 runs per 
data point. 



The appearance of diversity is clear when considering the ratio of co-interaction strength to self- 
interaction strength R =< k " 3 > shown in Figure At k « 5.55, R = 1, meaning that the 
co-interaction strength is greater than the self interaction strength for k > 5.55. For k > 10, R ~ k as 
co-interaction becomes the dominant driving force and selection acts to maintain positive interactions. 
Above this k, the individual terms in ^ JijUj and eEiUi are maximised by selection and therefore become 
independent of k (again ignoring fluctuations). For k < 5.55, interactions do not contribute to fitness 
of the wildtype. For k G (5.55, 10), the relative importance of the two driving forces changes. Below 
k = 5.55 diverse q-ESS states are not found 3 as co-interaction is always weaker than self-interaction 
leading to monodominance. 
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inter-species interaction strength k 



Figure 3: Average value of the ratio of the relative interaction strengths R as a function of fc, approximately a straight line for 
k > 8. 



As soon as it is possible to survive via co-interactions, the selection pressure on self-interaction 
becomes weaker. Although it is still beneficial to have the highest self-interaction possible, the additional 
variable of co-operation pulls species away from the maximally self-interacting peak. 

3 The apparent non-unity diversity below the threshold value is due to occasional mutant fluctuations above the wildtype 
threshold chosen. 
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An interesting feature of Figure ^ is that the number of q-ESS switches is high. At lower interaction 
strengths, monodominant q-ESSs tend to remain for the entire run, with a low possibility of a switch to 
another monodominant q-ESS with better self-interaction properties. As interaction strength increases, 
the number of q-ESS switches also increases leading to a greater rate of exploration in genotype space. 



3.2 Explanation for a cross-over in diversity 

We can show the existence off a threshold k value by a simple argument from the definition of H. 
Consider a system containing only one species, a, with genotype label S a , and an individual a from that 
species. Then for all individuals, H~ = H~ = eE a — fin a , as there is no other species present to interact 
with. We now consider adding an individual (3 of species b, thus N = (n a + n b ) ps n a 3> n b , and so: 



= J^L_ + eE aJ -^— M (n a + n b ) (4) 
(n a + n b ) (n a + n b ) 

w eE a - [m a (5) 



Similarly for (3 from species b: 



+ = kn^ha + tEb _^_ _ Kna + nb) (6) 
(n a + n b ) {n a + n b ) 

« k J ba - i±n a (7) 

For species b to be able to invade the system, we require that > vZfp or equivalently H b > H+. 
This can be rewritten as: 

* > ^ ( 8 ) 

•ha 

To prevent b taking over the system and forming a new monodominant state, we also require that in 
the limit of large n bl species a does not die out completely. Thus the reverse formula must also be true: 

k>^ (9) 

Jah 

E a has been selected to be high initially, as a was successful on its own; similarly J ba will be selected 
to be high to ensure b can proliferate. Thus the threshold fc (called k m i n ) is positive (and from Equation 
k m i n « e, as both E a and J ba will be close to 1). The more likely restraint on k m i n comes from 
EquationEI as neither E b nor J ab have been selected for - we take mean values to get an estimate. E b is 
uniform on (0, 1) and thus has mean 0.5, but J ab must be positive for a steady state to exist, and because 
of its distribution 0], has a mean value 4 of around J ab = 0.2. We therefore find that the minimum value 
of k for a diverse state to exist is around fc m m = 2.5e, so k m i n — 5 in the situation studied in this paper 
(e = 2). 

This means there is a threshold for k below which no diversity can be found, and then increasing k 
allows for increasing diversity. We do not expect accuracy in the numbers here, as we have taken no 
account of fluctuations around the mean values, or stochasticity in the reproduction process. Still, the 
match to the observed values is good. 



4 Discussion 

In our model, we have found that there will be, in general, a cross-over to a state with coexistence of 
diverse types as the interaction increases. There is a critical value of interaction strength beyond which 
the monodominance is broken down. In future work, it will be important to understand the nature of 
the cross-over theoretically, beyond the naive estimate of the mean-field type calculation given in 13.21 

4 J has a mean of zero, but here we are taking the mean of the positive part of the distribution, which has non-zero 
mean. 



5 



Each evolutionary course can be different in the simulation. If the initial species has neighbours in 
genotype space that interact favourably with each other and negatively against the wildtype, then it 
will quickly go extinct and a number of q-ESS switches are observed. Other initial conditions allow the 
interactions of local mutants to favour the wildtype, and monodominance continues for a longer time, 
possibly beyond the timescale of the simulation. On a cross-over from one q-ESS state to another, our 
model predicts that at low interaction strengths, only cross-overs between monodominant states can 
occur. However, if the interaction strength k > e then all initial wildtypes should be able to diversify 
eventually, possibly through a route of alternate monodominant states. If k 3> e, the contribution to 
the weight function from the self-interaction becomes negligible and the system reduces to the original 
Tangled Nature model with the weight function Hq, meaning all states are diverse. In the experiment 
of E.coli 2 , switches to distinct populations of different types were observed through the course of 
evolution, which may correspond to q-ESS in the model. By repeating experiments, this diversification 
was reproducible, irrespectively of initial conditions, even though the evolution course itself differs by 
each experiment. These behaviours are similar to those observed in the simulation. 

We have shown theoretically that diversification can occur non-spatially when interactions of sufficient 
strength are introduced. The interaction strength k can be interpreted as a density in bacterial systems, 
as the total flow of chemicals between cells and the media increases, which leads to strong interaction 
among cells. Indeed, by increasing the population density, changes to a state with diverse types was 
observed 0. 

Our theoretical results enable us to probe the underlying factor allowing diversification that is unob- 
servable in real systems. Essentially we require both: 

1. The mean current co-interaction strength is equal to, or greater than, the self- interaction strength; 
i.e. if a new individual of a new species would gain from interactions with the average population 
more than another of the same species, then it will flourish, with coexistence even if it is not a 
better competitor on its own. The greater the ratio of the co-interaction to the self interaction, 
the less selection pressure acts via the species characteristics (the e term in Eq. |3J). 

2. The possible mutations from the wildtype reinforce themselves, or other species, more than the 
wildtype. 

In the experiment by Kashiwagi et al., a main source of interaction is glutamine. The glutamine 
synthesised by 'efficient' types of bacteria is leaked out to the medium, through which they interact with 
each other. When they experimentally cut off the interaction by adding glutaminasc in the medium to 
eliminate the leaked glutamine, only the fittest remains. Although this experiment clearly demonstrates 
the importance of the interaction, the degree of the interaction cannot be determined from the experi- 
ment. Our model result suggests that there is a critical value of interaction beyond which the coexistence 
appears. Although the necessity of dense population to show the coexistence in the experiment suggests 
such kind of critical interaction value, it will be important to obtain it explicitly in the experiment and 
to study the nature of the cross-over. Also, according to the theory, the degree of interaction for the 
coexistence must be of the same order as the internal process, as characterised by the ratio R (see Figure 
OJ). The requirement that k w 3e is due to the values of the mean of (positive couplings) < J >« 0.2 and 
< e >= 0.5. This suggests, for example, that the leaked glutamine from the "efficient" bacteria is of the 
same order of the amount of that synthesised within. In fact, in experiments ,3. with higher population 
of the bacteria with very low enzyme activity, high leakage of glutamine is observed. 

Phcnotypic plasticity will assist in the second condition if multiple expressions of the wildtype are 
possible. Then a mutant may gain by specialising into one of these expressions This plasticity is 
present in our model only in that an individual's fitness is a function of the frequency of active genotypes, 
which arc linked in a complex network |12| created by the interactions. Thus, plasticity may explain 
why the random interactions used in the TaNa model provide a good explanation as it may increase 
differences between separate genotypes. Indeed, k can be thought of in terms of phenotypic plasticity, as 
the interactions emphasise the differences between species, with all similarities being absorbed into the 
/iN term. Thus, the existence of k m i n implies that there is a similar threshold in the plasticity, below 
which only one expression of a species will be observed but above which, multiple expressions of the 
same species coexisting may be found. 

There is nothing in the model that refers explicitly to bacterial systems, and the effect described 
here should be observable in macroscopic, sexually reproducing species, assisted by phenotypic plasticity 
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|1 lj and mating preference Of course, directly measuring the interaction strength is more difficult 
in these systems, and the timescales involved makes macroscopic evolutionary experiments difficult. 
Models like this should assist in determining which aspects of microbial experiments can be extrapolated 
to macroscopic evolution. 

We have shown that at low interaction strengths, the evolutionary stable state is simply the most 
efficient competitor dominating. However, at higher interaction strengths, competition for resource is 
mediated by interspecial interaction and coexistence may occur between varying ability competitors. 
Evolution may be faster in this regime as interaction opens up new survival niches, which increase with 
the diversity and hence increase the available niches again. We have also been able to qualitatively 
predict the existence of the two regimes from the dynamics in the model. 
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